An efficient algorithm for tree pattern query minimization under broad integrity constraints
نویسنده
چکیده
Purpose – Tree pattern is at the core of XML queries. The tree patterns in XML queries typically contain redundancies, especially when broad integrity constraints (ICs) are present and considered. Apparently, tree pattern minimization has great significance for efficient XML query processing. Although various minimization schemes/algorithms have been proposed, none of them can exploit broad ICs for thoroughly minimizing the tree patterns in XML queries. The purpose of this research is to develop an innovative minimization scheme and provide a novel implementation algorithm. Design/methodology/approach – Query augmentation/expansion was taken as a necessary first-step by most prior approaches to acquire XML query pattern minimization under the presence of certain ICs. The adopted augmentation/expansion is also the curse for the typical O(n) time-complexity of the proposed algorithms. This article presents an innovative approach called allying to effectively circumvent the otherwise necessary augmentation step and to retain the time complexity of the implementation algorithm within the optimal, i.e., O(n). Meanwhile, the graph simulation concept is adapted and generalized to a three-tier definition scheme so that broader ICs are incorporated. Findings – The innovative allying minimization approach is identified and an effective implementation algorithm named AlliedMinimize is developed. This algorithm is both runtime optimal – taking O(n) time – and most powerful in terms of the broadness of constraints it can exploit for XML query pattern minimization. Experimental study confirms the validity of the proposed approach and algorithm. Research limitations/implications – Though the algorithm AlliedMinimize is so far the most powerful XML query pattern minimization algorithm, it does not incorporate all potential ICs existing in the context of XML. Effectively integrating this innovative minimization scheme into a full-fledged XML query optimizer remains to be investigated in the future. Practical implications – In practice, Allying and AlliedMinimize can be used to achieve a kind of quick optimization for XML queries via fast minimization of the tree patterns involved in XML queries under broad ICs. Originality/value – This article presents a novel scheme and an efficient algorithm for XML query pattern minimization under broad ICs.
منابع مشابه
Minimization of XML Tree Pattern Queries in the Presence of Integrity Constraints
In this paper, we provide a polynomial-time tree pattern query minimization algorithm whose efficiency stems from two key observations: (i) Inherent redundant “components” usually exist inside the rudimentary query provided by the user. (ii) Irredundant nodes may become redundant when constraints such as co-occurrence and required child/descendant are given. We show the result that the algorith...
متن کاملEfficient Evaluation of Generalized Tree-Pattern Queries with Same-Path Constraints
Querying XML data is based on the specification of structural patterns which in practice are formulated using XPath. Usually, these structural patterns are in the form of trees (Tree-Pattern Queries – TPQs). Requirements for flexible querying of XML data including XML data from scientific applications have motivated recently the introduction of query languages that are more general and flexible...
متن کاملAdding Logical Operators to Tree Pattern Queries on Graph-Structured Data
As data are increasingly modeled as graphs for expressing complex relationships, the tree pattern query on graph-structured data becomes an important type of queries in real-world applications. Most practical query languages, such as XQuery and SPARQL, support logical expressions using logical-AND/OR/NOT operators to define structural constraints of tree patterns. In this paper, (1) we propose ...
متن کاملAn Effective Algorithm for XML Tree Pattern Matching and Pattern Tree Minimization
IJPRES Abstract— The extensible markup language XML has recently to come into read as a new standard for data representation and exchange on the internet. With XML becoming present language for data ability functions in varied domains, efficiently querying XML data is a critical issue. XML has become a follow normal to store, share and exchange business data across similar and dissimilar platfo...
متن کاملNon-zero probability of nearest neighbor searching
Nearest Neighbor (NN) searching is a challenging problem in data management and has been widely studied in data mining, pattern recognition and computational geometry. The goal of NN searching is efficiently reporting the nearest data to a given object as a query. In most of the studies both the data and query are assumed to be precise, however, due to the real applications of NN searching, suc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- IJWIS
دوره 3 شماره
صفحات -
تاریخ انتشار 2007